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A BOUND FOR DICKSON’S LEMMA 


JOSEF BERGER AND HELMUT SCHWICHTENBERG 

Abstract. We consider a special case of Dickson’s lemma: for any two 
functions /, g on the natural numbers there are two numbers i < j such 
that both / and g weakly increase on them, i.e., fi < fj and gi < gj. By 
a combinatorial argument (due to the first author) a simple bound for 
such i,j is constructed. The combinatorics is based on the finite pigeon 
hole principle and results in a descent lemma. From the descent lemma 
one can prove Dickson’s lemma, then guess what the bound might be, 
and verify it by an appropriate proof. We also extract (via realizability) 
a bound from (a formalization of) our proof of the descent lemma. 

Keywords: Dickson’s lemma, finite pigeon hole principle, program 
extraction from proofs, non-computational quantifiers. 


1. Introduction 

Consider the following special case of Dickson’s lemma: for any two func¬ 
tions /, g on the natural numbers there are two numbers i < j such that 
both / and g weakly increase on them, i.e., fi < fj and gi < gj. By a com¬ 
binatorial argument (due to the first author) a simple bound for such i,j is 
constructed. The combinatorics is based on the finite pigeon hole principle 
and results in a certain descent lemma. From the descent lemma one can 
prove Dickson’s lemma, then directly guess what the bound might be, and 
hnally verify it by an appropriate proof. We also extract (via realizability) 
a bound from (a formalization of) our proof of the descent lemma. 

In its usual formulation, Dickson’s lemma (for fixed functions) is a S^- 
formula. In contrast, we shall prove a quantifier-free statement which im¬ 
plies Dickson’s lemma in its usual form, but not vice versa. Our proof can 
be carried out in the formal system of Elementary Analysis [?, p.l44], a 
conservative extension of Heyting arithmetic with variables and quantifiers 
for number-theoretic functions. In fact, we don’t make use of the axiom 
of choice at all. Furthermore, we can restrict induction to quantifier-free 
formulas. 
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Dickson’s lemma has many applications. For instance, it is used to prove 
termination of Buchberger’s algorithm for computing Grobner bases [?], and 
to prove Hilbert’s basis theorem [?]. 

There are many other proofs of Dickson’s lemma in the literature, both 
with and without usage of non-constructive (or “classical”) arguments. The 
original proof of Dickson [?] and the particularly nice one by Nash-Williams 
[?] (using minimal bad sequences) are non-constructive, and hence do not 
immediately provide a bound. But it is well known that by using some 
logical machinery one can still read off bounds, using either Godel’s [?] 
Dialectica translation as in Hertz [?] or Friedman’s [?] H-translation as in [?]. 
However, these bounds - even for the case of just two functions considered 
here - heavily use higher type (primitive recursive) functionals and are less 
perspicious than the one obtained below. 

The first constructive proof of Dickson’s lemma has been given by Schiitte 
and Simpson [?, ?], using ordinal numbers and transfinite induction up to 
cq. Similar methods have been used by Sustik [?] and Martm-Mateos et 
al. [?]. Since initial segments of transfinite induction are used, these proofs 
when written in arithmetical systems require ordinary induction on quanti¬ 
fied formulas. A different constructive proof has been given by Veldman [?]. 
It uses dependent choice for Ei-formulas (with parameters), and induction 
on n 2 -formulas. This proof also provides the basis of Fridlender’s [?] forma¬ 
lization in Agda. The computational content of these proofs has not been 
studied; the bound involved will be very different from the present one. 

2. A COMBINATORIAL PROOF OF DiCKSON’S LEMMA 

We start with a finite pigeonhole principle, in two disjunctive forms. The 
(rather trivial) proofs are carried out because they have computational con¬ 
tent which will influence the term extracted from a formalization of our 
proofs in Section 3. 

Lemma 2.1 (FPHDisj). = fj V 3j<mm < fj). 

Proof. By induction on m. For m = 0 the second alternative holds. In 
case m -|- I let fj be maximal among /o,..., fm+i- If w- + 1 < /j we are 
done. Else we have fj < m. Now we apply the induction hypothesis to 
f' := /o,..., fj-i, fj+i, ■ ■ ■, fm+i- If two of them are equal we are done. 
Else m < fk for some /c / j and hence fj < fk- If fj = fk we are done. Else 
we have fj < fk, contradicting the choice of j. □ 

Note that quantifier-free induction suffices here, since we only prove a 
property of finite lists of natural numbers. 
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In the key lemma 2.3 below we will need a somewhat stronger disjunctive 
version of the pigeonhole principle. To this end we need an injective coding 
{n,m) of natural numbers which is “square-filling”, i.e. with the property 

(1) < {n,m) ^ k < ny k < m. 

This can be achieved by 


or explicitely 


12 13 14 15 

6 7 8 11 

2 3 5 10 

0 14 9 


(n, m) 


TT? + m 
m? + m + n 


if m < n, 
otherwise. 


Lemma 2.2 (FPHDisj2). 


f,g,k{^i<:j<k^{fi — fj ^ 9i — 9j) V < fj V < Qj). 

Proof. Fix f,g,k. Use Lemma 2.1 with Si := {fi,gi) and m := k‘^. In the 
first case from Sj = Sj we obtain fi = fj and gi = gj by the injectivity of 
the coding. In the second case we have some j < k'^ with /c^ < Sj. From the 
square-filling property (1) of the coding we obtain k < fj or k < gj. □ 


As an immediate consequence we have 

Lemma 2.3 (Key). 

f,g,n,k{^n<i<j<n+k^+l{fi ~ fj ^ 9i ~ 9j) 

^n<j<n+k‘^+lk — fj ^n<j<n+k^+l^ — 9j)- 
Proof. Use Lemma 2.2 for Xifn+i+i, Xign+i+i and k. □ 


Now we introduce some notation. Mini(/, n) is the first argument where 
/ is minimal on {0,..., n}: 

Mini(/,0) := 0, 

Mi„i(/. „+!):= 

n -|- 1 otherwise. 
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We define functions and a formula D with arguments f,g,n. For 

readability f,g are omitted. 

• /Mini(/,n) “1“ 9Mim{g,n)i 
I{n) ;= n + ^(n)^ + 1, 

D{ri) '.= ^i<^j<n{fi ^ fj ^ 9i — 9j)‘ 

D{n) expresses that n is a bound for Dickson’s lemma. 

The next lemma states a crucial property of the function I: either I{n) 
already is a bound for Dickson’s lemma, or else decreases properly when 
going from n to I{n). Since this cannot happen infinitely often, iteration of 
I will finally give us the desired bound. 


Lemma 2.4 (Descent). D{I{n)) V ^(/(n)) < ‘h(n). 

Proof. Use Lemma 2.3 with /, g, n and 'k(n). In the first case we have 
D{I(n)). In the second case we have n < j < I{n) with T(n) < fj; the 
third case is symmetric. Let i := Mini(g', n). Then /j < 'l'(n). In case 
9i < 9j we have D{I{n)) and are done. Therefore assume gj < gi. We show 
(i) <I>(/(n)) < $(j) and (ii) <h(j) < <I>(n). From j < I{n) we obtain (i). 
For (ii) we show fuimifj) + 9Mim(g,j) < /Mini(/,n) + 9i- Now n < j implies 
/Mini(/,j) — /Mim(/,n)) &Ild 9Mini{g,j) — 9j ^ 9i- ^ 

From Lemma 2.4 we construct a bound for Dickson’s lemma. Let 
/°(n) := n, := /(/™(n)). 

Lemma 2.5. D{I^{0)) V <h(/"'(0)) + n < <h(0). 

Proof. Induction on n. Step n i-a n + 1. Applying Lemma 2.4 to /”(0) gives 
^(jn+i( 0 )) v $(/"■+! (0)) < <h(/”(0)). In the second case we have 

$(/”+^(0)) + n + 1 < $(/"(0)) + n + 1 < $(0) + I 

The latter inequality follows from the induction hypothesis, since D(/”(0)) 
implies D(/"'+^(0)). □ 

Proposition 2.6. 

Proof. Apply Lemma 2.5 to <I>(0) + 1. □ 


This bound is far from optimal: already for 


fn ■■= 


1 if n = 0, 
0 else 


9n ■= 0 
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with optimal bound 2 we have 

j/o+<?o+i(o) = /2(o) = /(/(O)) > /(O) = ^(0)2 + 1 = 2. 

Can we extend this proof to show Dickson’s lemma for finitely many 
functions? For instance for three functions a corresponding version of the 
key lemma holds; 

f,g,h,n,ki^n<i<j<n+k'^+l{fi ~ fj ^ 9i ~ 9j ^hi = hj) V 

^n<j<n+k^+l^ — ^n<j<n+D+l^ — 9j ^n<j<n+k‘^+l^ — 

(Proof. Apply the original key lemma to {f,g),h,n and /c^). We can also 
define a measure function $(n) ;= /Mini(/,n)+ffMini(g,n)+^Mini+,n)- A natural 
candidate for is 

'k(Tl) . , /Mini(/i,n)) 9Mim{f,n )) 9Mim{h,n) ) ^Mini(/,n)) ^Mini(g,n)} 

and a natural candidate for I is I{n) := n+'I'(n)^ + l. But the corresponding 
version of the descent lemma is false: let n := 2 and 

/;=(0,l,l,l,0,/5,...), 

9 ■= (1,0,1,0,1,55,...), 
h := (l,l,0,0,0,/r5,...). 

Then <l>(n) = 0, 'k(n) = 1, /(n) = 4, and we neither have D{I{n)) nor 
$(/(n)) < <k(n). ~ However, it may well be that a more refined form of the 
present approach works. We leave this for future research. 


3. Extracting computational content 

In the following, we demonstrate how a bound for Dickson’s lemma can 
be extracted from a proof of the existence of such a bound. The proof we 
will use is essentially the one presented in Section 2, i.e., it is based on the 
descent lemma 2.4. We will then apply the realizability interpretation to 
obtain the bound. In fact, the bound will be machine extracted from a 
formalization of the existence proof. 

In more detail, we shall use that I is increasing (i.e., n < I{n)) and that 
from D{n) and n < m we can infer D{m). Then we prove the existence of 
a bound by general induction with measure <k. 
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3.1. General induction and recursion. We first explain general induc¬ 
tion w.r.t. a measure, and the corresponding definition principle of general 
recursion. 

General induction allows recurrence to all points “strictly below” the 
present one. In applications it is best to make the necessary comparisons 
w.r.t. a measure function n; for simplicity we restrict ourselves to the case 
where fx has values in the natural numbers, and the ordering we refer to is 
the standard <-relation. The principle of general induction then is 

V;,,a;(Prog^Px Px), 

where Prog^Px expresses “progressiveness” w.r.t. jj, and <, i.e., 

Prog^Px := Va;(Vy(/i?/ < fxx ^ Py) —> Px). 

It is easy to see that in our special case of the <-relation we can prove 
general induction from structural induction. However, it will be convenient 
to use general induction as a primitive axiom, for then the more efficient 
general recursion constant P will be extracted. It is defined by 

PfxxG = Gx{\y[iifxy < yx then PyyG else e]), 

where e denotes a canonical inhabitant of the range. It is easy to prove that 
P is definable from an appropriate structural recursion operator. 

3.2. Non-computational quantifiers. We now use general induction in 
our constructive proof of Dickson’s lemma. However, we have to be careful 
with the precise formulation of what we want to prove. We are not interested 
in the pair i, j of numbers where both / and g increase, but only in a bound 
telling us when at the latest this must have happened. Therefore the existen¬ 
tial quantifiers must be made “uniform” (i.e., non-computational); it will 
be disregarded in the realizability interpretation. Such non-computational 
quantifiers have first been introduced in [?, ?]; in [?] this concept is extended 
to all connectives and discussed in detail. Let 

D'{n) := < fj A Qi < gj). 

Using this non-computational form of D{n) we modify Lemma 2.4 to 

Lemma 3.1 (Descent’^'’). D'{I{n)) V <I>(/(n)) < <k(n). 

Note that the computational content of a proof of this lemma is that of a 
functional mapping two unary functions and a number into a boolean. From 
Lemma 3.1 we obtain as before a modification of Proposition 2.6 to 
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Proposition 3.2 (Bound for Dickson’s lemma). 

V/,g,n3fc(/(n) <kA D'{k)). 

Proof. By general induction with measure function <1>. We fix f,g and prove 
progressiveness of the remaining Vndfc-formula. Therefore we can assume as 
induction hypothesis that for all m with d>(m) < <l>(n) we have 

3fc(/(m) < k A D'{k)). 

We must show 

3fc(/(n) < k A D'{k)). 

By Lemma 3.1 we know D'{I{n)) V <l>(/(n)) < <l>(n). In the first case we 
have D'{I{n)) and can take k := I{n). In the second case we apply the 
induction hypothesis to I{n). It provides a k with I{I{n)) < k and D'{k). 
But I{n) < I{I{n)) since n < I{n). □ 

3.3. Formalization and extraction. The formalization^ (in Minlog^) of 
the proof above is now routine. The term extracted from it is 
[f,g,n](GRecGuard nat nat)(Phi f g)n 
([nO,fl] [if (cDesc f g nO) (I f g nO) (fl(I f g nO))]) 

True 

To explain this term we rewrite it in the notation above 
with measure g and step function G defined by 

/i := 

if cDesc(n), i.e., D'{I{n)), 

’ ]h{I{n)) otherwise, i.e., <h(/(n)) < I{n), 

where for readability we again omit the arguments f,g from <!>,/, cDesc. 
The functions d>,/ are defined as in (2), and cDesc is the computational 
content of Lemma 3.1: 

[f,g,n][case (cKey f g n(f(Mini g n)max g(Mini f n))) 

((DummyL nat ysum nat) -> True) 

(Inr nn -> 

[case nn 

((InL nat nat)nO -> 

(cNatLeLtCases boole)(g(Mini g n))(g nO)True False) 


^Available at git/minlog/examples/arith/dickson. scm 
See http://www.minlog-system.de 
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((InR nat nat)nO -> 

(cNatLeLtCases boole)(f(Mini f n))(f nO)True False)])] 
Here nn is a variable of type N + N with N the type of natural numbers, 
and cNatLeLtCases: 

(Rec nat=>nat=>alpha=>alpha=>alpha)n 
([nO,x,xO][case nO (0 -> xO) (Succ nl -> x)]) 

([n0,h,nl,x,x0][case nl (0 -> x) (Succ n2 -> h n2 x xO)]) 
is the computational content of the (simple) proof of 

Vn,m((R- < m —> P) ^ (m < n —> P) —> P) 

expressing case distinction w.r.t. < and <. 

cKey is the computational content of Lemma 2.3: 

[f ,g,n,n0] 

[case (cFPHDisjTwo([nl]f(Succ(n+nl)))([nl]g(Succ(n+nl)))n0) 
((DummyL nat ysum nat) -> (DummyL nat ysum nat)) 

(Inr nn -> 

Inr[case nn 

((InL nat nat)nl -> (InL nat nat)(Succ(n+nl))) 

((InR nat nat)nl -> (InR nat nat)(Succ(n+nl)))])] 
which uses cFPHDisjTwo: 

[f,g,n][if (cFPHDisj (n*n) 

([nO] [if (g n0<f nO) 

(f n0*f nO+g nO) 

(g n0*g nO+g nO+f nO)])) 

([ij](DummyL nat ysum nat)) 

([nO] 

Inr[if (cCodeSqFilKf nO) (g n0)n) 

((InL nat nat)n0) 

((InR nat nat)n0)])] 
which in turn depends on cCodeSqFill: 

[n,n0,nl](Rec nat=>nat=>boole)n([n2]False) 

([n2,(nat=>boole),n3] 

[case n3 (0 -> True) (Succ n -> (nat=>boole)n)]) 
nO 

and cFPHDisj: 

[n](Rec nat=>(nat=>nat)=>nat@@nat ysum nat)n 
([f](InR nat nat@@nat)0) 

([n0,d,f] 
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[let nl 

[if (f(Succ nO)<=f(Maxi f nO)) (Maxi f nO) (Succ nO)] 

[if (Succ nO<=f nl) 

((InR nat nat@@nat)nl) 

[if (d([n2][if (n2<nl) (f n2) (f(Succ n2))])) 

([ij] 

(InL natSSnat nat) 

[if (right ij<nl) 

ij 

([if (left ij<nl) 

(left ij) 

(Succ left ij)]@Succ right ij)]) 

([n2] 

[if (n2<nl) 

((cNatLeCases nat@@nat ysum nat)(f n2)(f nl) 

((InL natOOnat nat)(0@0)) 

((InL natOOnat nat)(n2@nl))) 

((cNatLeCases nat@@nat ysum nat)(f(Succ n2))(f nl) 
((InL natOOnat nat)(000)) 

((InL natOOnat nat)(nlOSucc n2)))])]]]) 

To summarize, we have extracted a function which takes two functions 
/, g (suppressed for readability) and a number n and yields a bound. Notice 
that already with n = 0 we obtain the desired bound for Dickson’s lemma. 
However, the inductive argument requires the general formulation. 

Our extracted bound B{n) := F^nG satisfies 


B{n) = F^nG = Gn(Am[if ‘bm < <hn then F^mG else e]) 


I{n) if D'{I{n)), 
B{I{n)) if <I>(/(n)) < I{n). 


by Lemma 3.1, which also guarantees termination: B{n) will call itself at 
most I(n) times. As long as the iterations I{n), l‘^{n), ..., I^{n) decrease 
w.r.t. the measure d>, the next iteration step is done. However, as soon 
as Lemma 3.1 goes to its “left” alternative (i.e., D'(/(n)) holds), I(n) is 
returned. Hence this extracted bound differs from the “guessed” one in 
Proposition 2.6 in that it does not iterate I a prescribed number of times 
(/o + ffo + 1 many) at 0, but stops when allowed to do so by the outcome of 
Lemma 3.1. 



